MGD-GAN: Text-to-Pedestrian Generation Through Multi-grained Discrimination
نویسندگان
چکیده
In this paper, we investigate the problem of text-to-pedestrian synthesis, which has many potential applications in art, design, and video surveillance. Existing methods for text-to-bird/flower synthesis are still far from solving fine-grained image generation problem, due to complex structure heterogeneous appearance that pedestrians naturally take on. To end, propose Multi-Grained Discrimination enhanced Generative Adversarial Network, capitalizes a human-part-based Discriminator (HPD) self-cross-attended (SCA) global order capture coherence body structure. A fined-grained word-level attention mechanism is employed HPD module enforce diversified vivid details. addition, two pedestrian metrics, named Pose Score Variance, devised evaluate quality diversity. We conduct extensive experiments ablation studies on caption-annotated dataset, CUHK Person Description Dataset. The substantial improvement over various metrics demonstrates efficacy MGD-GAN scenario.
منابع مشابه
Transformation enhanced multi-grained modeling for text-independent speaker recognition
We describe our formulation of transformation enhanced data modeling used to develop a multi-grained data analysis approach to text independent speaker recognition. The broad goal is to address difficulties caused by sparse training and test data. First, our development of maximum likelihood transformation based recognition with diagonally constrained Gaussian mixture models is detailed. We giv...
متن کاملMulti-Lingual Text Generation and the Meaning-Text Theory
We describe multi-lingual text generation as an alternative to automatic translation in specified technical sublanguages, illustrating the notion with the implemented RAREAS-2 system for synthesizing marine weather forecasts in English and French. We then review the Meaning-Text Theory (MTT) of Mel'cuk et al. as we have applied it to text generation in the GOSSIP system for producing English re...
متن کاملAttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
In this paper, we propose an Attentional Generative Adversarial Network (AttnGAN) that allows attention-driven, multi-stage refinement for fine-grained text-to-image generation. With a novel attentional generative network, the AttnGAN can synthesize fine-grained details at different subregions of the image by paying attentions to the relevant words in the natural language description. In additi...
متن کاملEnabling Interface Validation through Text Generation
To obtain the information encoded in software it is necessary to master both the implementation languages and the tools. This is not only a problem for managers and user groups who have a claim in the outcome but not the necessary training in software development to decode the implementation our case study shows that it is also a problem for the software developers. In contrast, text can be und...
متن کاملConciseness through Aggregation in Text Generation
Aggregating different pieces of similar information is necessary to generate concise and easy to understand reports in technical domains. This paper presents a general algorithm that combines similar messages in order to generate one or more coherent sentences for them. The process is not as trivial as might be expected. Problems encountered are briefly described.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-88007-1_54